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We present recent results of the NNPDF collaboration on a full DIS analysis of Parton Dis- 
tribution Functions (PDFs). Our method is based on the idea of combining a Monte Carlo 
sampling of the probability measure in the space of PDFs with the use of neural networks 
as unbiased universal interpolating functions. The general structure of the project and the 
features of the fit are described and compared to those of the traditional approaches. 

1 Introduction 

Experimental uncertainties in hadronic colliders are decreasing to a level where the most careful 
consideration has to be given to uncertainties in theoretical predictions. For hadronic processes 
involving high virtualities, the cross section can be written as a convolution of a partonic cross 
section and process-independent non perturbative functions, Parton Distribution Functions. The 
latter cannot be derived from perturbative QCD, however their evolution is. Consequently we 
can extract PDFs from one experiment and use them as a theoretical input for another. These 
are a key element for any phenomenological prediction and therefore a good knowledge of PDFs 
and of their errors is fundamental^. 

Determining a set of functions with errors from a finite set of data points is far from being 
straightforward because of the many theoretical, experimental and phenomenological complica- 
tions. The correct definition of an error band in the space of functions requires one to build 
up a probability density within that space. Let J-[f a (x)] be an observable depending on a PDF 
f a (x); then its average value is defined as 

{T[f a ]) = J [Vf a ]V[f a (x)}T[f a (x)}, (1) 

where V is the probability density that one aims to determine. However this problem cannot 
be solved without introducing some theoretical assumptions, given that one is trying to infer an 
infinite amount of information from a finite number of points. The traditional solution consists in 
choosing a specific functional form such that the infinite-dimensional space of functions reduces 
to a finite-dimensional space of parameters which determine the chosen parametrisation. However 
the standard approaches I^MZI suffer of several drawbacks, mainly due to the lack of control on 
the bias introduced the choice of a specific parametrisation and, more subtly, to the difficulty in 
providing a consistent statistical interpretation of their PDFs uncertainties when dealing with 
non-gaussian uncertainties and incompatible data. 



2 NNPDF approach: main ingredients 



The difficulties mentioned in the previous section have stimulated various proposals for new ap- 
proaches. The method proposed by the NNPDF collaboration was first tested on the parametri- 
sation of DIS structure function data^H Subsequently it was applied to the determination of a 
single non-singlet parton distribution^. 
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Figure 1: Schematic representation of a multilayer feedforward neural network. 

Schematically one might divide the procedure typical of this approach into three main steps: 
first one generates N vep Monte Carlo sets of artificial data about the original ./Vdat experimental 
points, distributed according to a multi- Gaussian distribution whose spread is determined by 
experimental errors. With 0(1000) replicas the artificial sets reproduce central values, errors 
and correlations of the original data set within one per mil accuracy. Then for each replica one or 
more neural networks are trained in order to fit the artificial data; each neural network provides 
a parametrisation of one independent PDF at some fixed initial scale. The output of the neural 
network must be evolved to the scale of the experimental points in order to be compared to the 
data and in this way the parameters of the networks are fitted. Finally the set of iV re p trained 
neural networks provides a discrete representation of the probability density. Therefore Eq. [T] 
reduces to an average over the ensemble of replicas, 

, N iep 

(Hfa(x)]) = — J2Hf { a k)inCt \x)]. (2) 
JVre P k=l 

One ends up with a statistically accurate Monte Carlo determination of errors: the errors of 
PDFs or more generally of any observable depending on them is given by 

°nu(*)] = M/«W1 2 ) - (3) 

in the same way one is able to evaluate any other statistical estimator. 

It is important to outline that neural networks, Fig. [TJ provide nothing but a redundant 
and unbiased parametrisation for the PDFs at the initial scale: they are approximants whose 
functional form adapts well to a noisy and incomplete set of data. Moreover, having constructed 
a Monte Carlo ensemble of replicas, it is easy to perform a variety of tests in order to assess 
the statistical consistency of our results as well as their independence on the details of the 
fitting procedure, in primis on the choice of parametrisation. The flexibility of neural networks 
architecture is particularly suitable to this kind of systematic analysis^. In principle, any other 
redundant parametrisation with the same features would be suitable. 

More specifically the redundancy is the essential feature of a parametrisation which ensures 
that one ends up with results minimally biased by the choice of the functional form. On the 
other hand the absolute minimum of the x 2 m a redundant parameter space would correspond 
to an over-learning of data, i.e. a regime where the parametrisation adapts not only to the 
physical behaviour but also to statistical fluctuations. Hence we need a criterion in order to 
stop the fit once having learnt the physical behaviour but before over-learning the data. To do 
this, for each replica we divide randomly the data into two sets: training and validation. The 
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Figure 2: Training and validation \ 2 as a function of number of iterations for a given replica (left). Detail of the 

stopping region (right). 



first set is the one on which we actually perform the fit by minimising the fully correlated x 2 ; the 
latter is also evaluated over the validation set at each iteration of the minimisation. When the 
training x 2 is still decreasing and the validation one starts increasing, as it is shown in Fig. [2] 
we are entering in the over-learning region and therefore the fit must be stopped. 

3 DIS analysis 

The extension of the 11011 sing let fiP to a full DIS analysis is the upcoming result of the NNPDF 
collaboration. This extension involves the significant complication of going from one PDF and 
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Figure 3: Kinematic range of experiments included in the fit. 



one observable to a set of parton distributions extracted from many different experiments and 
observables. DIS and neutrino data, structure functions and reduced cross sections are included, 
as shown in Fig. [3j For this first fit we have used a smaller set of PDFs by imposing some 
flavor assumptions like the symmetric strange sea s(x) = s(x) and the proportionality between 
strange and non-strange seas s(x) = ^(u(x) + d(x)) ^] With these assumptions we parametrise 
four combinations of quarks and the gluon distribution at a fixed initial scale (Qq = 2 GeV 2 ) by 
mean of five multilayer neural networks: V(x) = (u v + d v )(x), T$(x) = (u + u — d — d)(x) , 

A.s(x) = (d — u)(x), g{x). The overall normalisation of the nets is fixed by imposing momentum 



a C ~ 0.5 as suggested by dimuon data 



and valence sum rules. In the present code we perform a hybrid N-space and x-space Next-to- 
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Figure 4: Gluon at Q 2 , — 2GeV 2 (left). Reduced neutrino cross section at Q 2 = 12GeV 2 (right). Preliminary 

results. 

Leading Order evolution; the solution of the DGLAP equations is calculated analytically into 
the Mellin space, using a Zero-Mass Variable Flavour Number scheme and including Target Mass 
Corrections. Then it is inverted back into the x space and convoluted with the initial x-space 
parton distributions provided by the nets. 

From our preliminary results, Fig. [4] we see that while in the data region the different 
approaches are compatible, in the extrapolation region they do deviate from each other. Inter- 
estingly our fit produces results consistent with those obtained by the other collaborations0HZl 
and our error bands tend to get bigger in the region where data do not constrain PDFs behaviour. 
These results suggest that standard approaches to PDFs fitting might lead to an underestima- 
tion of errors associated with parton densities and that our combination of MC techniques and 
neural networks is a feasible alternative as well as adequate for detailed statistical studies. Hav- 
ing published the full DIS fit, the next step will be the inclusion of more data sets and some 
phenomenological studies on the impact of our error bands on the main LHC observables. 
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